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ENCODING AND DECODING APPARATUS AND METHOD 
WITH HAMMING WEIGHT ENHANCEMENT 

INCORPORATION BY REFERENCE OF RELATED APPLICATION 
[0001] The present application is related to U.S. Provisional Patent Application Serial 
No. 60/273,511, entitled "Hamming Weight Encoding Scheme With Permutation 
Relative To ECC," filed March 7, 2001, which names as inventors Pantas Sutardja and 
Toai Doan (the inventors of the present invention), and which is hereby incorporated 
herein by reference. 

BACKGROUND 

1. Field of the Invention 

[0002] The present invention relates generally to encoding and decoding for data 
transfer and, more particularly, to a method and apparatus for use in encoding and/or data 
by providing for a Hamming weight enhancement to the data. 

2. Related Art 

[0003] Data is frequently stored and/or communicated in binary form, that is, as a 
sequence of binary digits or bits having zero and one values. When such data is 
communicated or simply transferred to a storage medium, such as occurs in the operation 
of a disk drive, for example, there is a substantial risk of transmission errors (e.g., a one- 
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valued bit being received when a zero-valued bit was transmitted or vice-versa) caused 
by noise in the communication channel and other factors. These transmission errors are 
especially significant when data compression techniques are used to reduce the number of 
bits needed to communicate a particular message (e.g., in a disk drive or, more generally, 
in a communication channel), because a single erroneous bit in a compressed message 
can result in corruption of a larger amount of information in the message after the 
message is decompressed. Disk drives, of course, are well-known in the art, but 
additional information and a general circuit arrangement for one exemplary disk drive 
apparatus may be found, for example, in Wakamatsu U.S. Patent No. 601 1,666, issued 
January 4, 2000, the disclosure of which is hereby incorporated herein by reference as if 
set forth herein in its entirety. 

[0004] Many different approaches to overcoming this problem with transmission errors 
have been developed. FIG, 1 illustrates one example of a prior-art write-channel encoder 
apparatus for encoding information for communication via a channel (or for storage on 
any suitable media). As shown, an encoder 18 receives user data 20 and sequentially 
performs error correction coding or ECC (block 22), run length limit or RLL encoding 
(block 24), and precoding (block 26) on the user data 20 to thereby produce encoded data 
which is then written to any suitable media and/or any suitable communication channel 
(block 28). 
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[0005] An inverse process for decoding the information encoded by the encoder 18 of 
FIG, 1 is performed by a prior-art read-channel decoding apparatus such as the decoder 
29 illustrated in FIG. 2. Initially, the decoder 29 reads or retrieves encoded information 
from a media source or a communication channel (block 30) and performs a maximum 
likelihood sequence detection (MLSD) on that encoded information (block 32). As will 
be appreciated by those of ordinary skill in the art, this MLSD detection compares the 
retrieved encoded information with a set of all possible information sequences and 
determines which of the possible sequences most likely represents the encoded 
information based on which of the possible information sequences is "closest" {e.g., by 
computing a predetermined error quantification metric for each possible sequence and 
selecting that sequence which has the smallest or minimum value for the computed error 
quantification metric). 

[0006] The read-channel decoder 29 then performs reversed precoding on the detected 
data sequence (block 34) followed by RLL decoding (block 36) and ECC decoding 
(block 38) to thereby derive user data 40, which corresponds to the user data 20 encoded 
by the encoder 18 (FIG. 1). 

[0007] In this approach, the RLL decoding step in the data recovery process 
undesirably causes small errors to be propagated into many symbols of error in the 
decoded message. As will be appreciated by those of ordinary skill in the art, the number 
of symbol errors that can be fixed by the error correction coding (ECC) is limited by 
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design. Consequently, the RLL decoding used in this approach degrades the final error 
rate of the data recovery process. 

[0008] One solution to this problem has been to reduce the size of the actual encoded 
word to minimize the maximum possible size for error propagation. In the most extreme 
case, only one symbol is encoded, and the rest of the symbols in the code word are left 
unencoded. 

[0009] By way of example, where a communication scheme employs an eight-bit 
symbol size and an unencoded code word made up of 4 symbols {ul, u2, u3, u4}, the 
best error correction is obtained by simply encoding one of the symbols into a 9-bit code 
word. For example, if ul is encoded to produce a 9-bit code word El a final code with 
low error-propagation can be constructed as follows: 

final code = {u2, ela, u3, elb, u4} 
where ela is the first (most significant) 4 bits of the code word El and elb is the last 
(least significant) 5 bits of the code word El. In such a scheme, following the decoding 
process, errors of small size (i.e., over 2 bits more than the smaller of ela or elb) can 
corrupt at most two symbols, which may occur when an error encompasses an unencoded 
symbol and one of the partially encoded symbols. 

[0010] The advantage of this kind of simple encoding scheme is in the reduced error 
propagation after RLL decoding and thus the total number of symbols that can be 
corrupted by a single error event. As a consequence, this encoding scheme allows for an 
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increase in the total number of error events the error correction coding (ECC) can correct. 
A significant disadvantage of this scheme is that the smaller encoded symbol makes the 
minimum Hamming weight of the total code word very small (i.e., equal to the Hamming 
weight of the single encoded symbol). As will be appreciated by those of ordinary skill 
in the art, the Hamming weight of a code word or other collection of symbols is defined 
as the number of non-zero symbols in the code word or collection of symbols in the 
Interleaved Non-Return to Zero Inverting (INRZI ) domain. For binary signaling, the 
Hamming weight of a bit stream is the number of "1" bits in the bit stream. The read- 
channel device (i.e., the decoder) performs numerous functions, such as timing and gain 
recovery, before it passes sampled data to the MLSD detector. The purpose of RLL 
encoding is not only to remove quasi-catastrophic sequences that can degrade the 
performance of the MLSD, it is also used to introduce a guaranteed minimum number of 
signal transitions to allow the gain and timing loops to operate. Codes with high 
Hamming weight typically provide more signal transitions that can be used for the timing 
and gain loops. Codes with very low Hamming weight may not provide adequate timing 
and gain gradients for the channel to provide correct sampling phase and gain control. 

[0011] This encoding scheme is least effective when every encoded symbol in the 
entire encoded data field has minimum Hamming weight and all the other unencoded 
symbols are zeros. The degradation in the timing and gain loops as a result of the 
channel operating on a sequence of low Hamming weight codes can be so large that it 
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negates the benefit of the reduced error propagation made possible by this encoding 
scheme. 

[0012] To mitigate this problem, the storage industry has used data-scrambling 
technique to minimize the probability of creating a large number of zeros in the user 
symbols. Unfortunately, this technique merely makes it more difficult for an end-user to 
locate a bad data sequence. There is still some finite probability that the bad data 
sequence can be created accidentally by the scrambling of some random data (e.g., 
random data created by running a string of zero-valued bits through the same randomizer 
used in the channel). This randomizer technique is therefore less than satisfactory. 

[0013] FIG, 3 illustrates an alternative embodiment of a prior-art write-channel 
encoder apparatus 41, which is similar to the encoder apparatus 18 of FIG. 1, except that 
the order in which the error correction coding (ECC) and the run length limit (RLL) 
encoding are performed is reversed in the encoding apparatus 41 of FIG. 3. In other 
words, as shown, run length limit encoding is performed on user data 42 (block 44), and 
error correction coding is subsequently performed (block 46). The result then undergoes 
precoding (block 48) and the encoded information is then written to any suitable media 
and/or a communication channel (block 50). 

[0014] Similarly, FIG. 4 illustrates a read-channel decoding apparatus 5 1 which reads 
encoded information from a media or a communication channel (block 52) performs 
MLSD detection (block 54), reversed precoding (block 56), error correction code (ECC) 
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decoding (block 58), and run length limit (RLL) decoding (block 60) to generate user 
data 62, which corresponds to the user data 42 processed by the encoding apparatus 41 
(FIG. 3). 

[0015] Permuting or reversing the ECC and RLL encoders in the "write" process in this 
fashion serves to prevent or at least reduce error propagation. In such an encoding 
scheme, the ECC would correct on the data before RLL decoding, thereby completely 
avoiding the error propagation effect of RLL decoding. The output after ECC correction 
is assumed to be perfect user data, such that the RLL decoding following the ECC will 
not involve any error propagation. This method of recording and data recovery allows 
RLL code to be designed in such a way as to optimize for Hamming weights and thereby 
improve the performance of the timing and gain loops of the channel. Thus, the error 
propagation effect of the previously described encoding scheme are obviated. 

[0016] While this latter encoding scheme does provide at least a theoretical 
improvement in terms of reduced error propagation compared to the encoding scheme 
described above, it also has certain drawbacks due to compatibility issues that arise from 
the way in which storage devices that employ these encoding schemes are traditionally 
certified. Traditionally, technical customers of mass storage device often test the ECC 
capability of a given mass storage device before deciding to purchase the device. A 
common method of performing such a test involves so-called "read-long" and "write- 
long" processes. 
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[0017] In a read-long process, the drive or other device is instructed to read a certain 
data sector and return all the user symbols in that sector, plus the included ECC symbols, 
to the host. The host then runs a program that deliberately corrupts some number of bits 
in the received data (i.e., the user symbols and ECC symbols) and returns the corrupted 
data back to the drive using a write-long command. The write-long command instructs 
the drive to write all the long data back to the disk without additional ECC. In a 
subsequent normal read, the data is read back to the host and the host checks whether the 
ECC during a normal read has corrected the error that the host had deliberately 
introduced. This process is done repeatedly using long data corrupted at different 
location with different numbers of group of corrupted bits and different numbers of 
corrupted bits per group. These read-long and write-long instructions provide a means 
for the technical user to verify that the drive indeed has the specified error correction 
capability before deciding to purchase the drive. 

[001 8] A problem with the permuted ECC/RLL scheme is that one corrupted symbol 
can generate a large number of encoded symbols that are corrupted, assuming that the 
RLL encoder used is the one that is optimized for the timing and gain loops of the 
channel. Thus, the conventional read-long and write-long test method will produce a 
result that appears to suggest that the ECC performance of the drive is vastly inferior to 
what is claimed by the manufacturer, and a technical customer may decline to purchase 
the drive because of this adverse test result. This, in turn, will cause the manufacturer of 
the drive to lose market share to other manufacturers that do not incorporate ECC/RLL 
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permutation. For this reason, the permuted ECC/RLL scheme may not be readily 
accepted in the storage industry. 

SUMMARY 

[0019] The present invention relates to a method and apparatus for encoding and/ or 
decoding information for storage or communication via a channel with greater reliability 
than prior methods such as those described above. More particularly, the present 
invention employs an enhancement to the Hamming weight of data prior to encoding to 
increase the effectiveness of the ECC and RLL encoding processes. 

[0020] According to one aspect of the present invention, initial binary data to be 
communicated or stored is obtained, and the characteristic Hamming weight of the initial 
binary data is determined. The characteristic Hamming weight of the initial binary data 
is then compared with a predetermined value, and the initial binary data is processed 
based on the comparison to thereby develop processed binary data having a Hamming 
weight not less than the characteristic Hamming weight of the initial binary data. The 
characteristic Hamming weight of the initial binary data preferably is determined by 
counting one-valued bits in the initial binary data, and the predetermined value preferably 
is a predetermined minimum Hamming weight threshold value. 

[0021 ] Processing of the initial binary data may comprise bitwise inverting of the initial 
binary data if the Hamming weight of the initial binary data is less than the 
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predetermined value and may further comprise supplying an indication of whether the 
Hamming weight of the initial binary data is less than the predetermined value and/or an 
indication of whether bits of the processed binary data are inverted. Such an indication 
can comprise a binary digit having a first value if the bits of the processed binary data are 
inverted and having a second value otherwise. In some embodiments, the indication 
comprises a binary digit having a first value if the Hamming weight of the initial binary 
data is less than the predetermined value and having a second value otherwise. 
Processing of the initial binary data may also comprise performing at least one of error 
correction coding, run-length encoding, and precoding. 

[0022] According to another aspect of the invention, communication encoding is 
performed by a method wherein a first sequence of binary digits that collectively have a 
characteristic Hamming weight is obtained and each of the binary digits in the first 
sequence of binary digits is inverted if the Hamming weight of the first sequence of 
binary digits is below a predetermined threshold Hamming weight value. An indication 
is provided of whether the binary digits in the first sequence of binary digits have been 
inverted. 

[0023] According to yet another aspect of the present invention, a bitstream having a 
characteristic Hamming weight is communicated to a destination via a channel by 
determining the characteristic Hamming weight of the bitstream, inverting each bit in the 
bitstream if the characteristic Hamming weight of the bitstream is below a threshold 
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value and developing an indication of whether the bits in the bitstream are inverted, and 
delivering the bitstream and the indication of whether the bits in the bitstream are 
inverted to the destination via the channel. Each bit in the bitstream is then inverted at 
the destination if the indication indicates that the bits are inverted. 

[0024] According to still another aspect of the present invention, data is communicated 
from a source to a destination via a channel by obtaining initial binary data having a 
characteristic Hamming weight at the source, determining the characteristic Hamming 
weight of the initial binary data, performing a comparison of the characteristic Hamming 
weight of the initial binary data with a predetermined value, and processing the initial 
binary data based on the comparison to thereby develop processed binary data having a 
Hamming weight not less than the characteristic Hamming weight of the initial binary 
data. The processed binary data is then conveyed from the source to the destination via 
the channel, and the processed binary data is received from the source at the destination 
and the initial binary data is regenerated from the processed binary data. 

[0025] Communication decoding according to the present invention also may be 
performed by receiving processed binary data including a binary code word and an 
indication of whether bits of the binary code word are inverted and inverting each bit of 
the binary code word if the indication indicates that the bits are inverted. 

[0026] According to another aspect of the present invention, a communication 
encoding apparatus includes a data input for receiving initial binary data having a 
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characteristic Hamming weight and a processor in communication with the data input for 
determining the characteristic Hamming weight of the initial binary data, performing a 
comparison of the characteristic Hamming weight of the initial binary data with a 
predetermined value, and processing the initial binary data based on the comparison to 
thereby develop processed binary data having a Hamming weight not less than the 
characteristic Hamming weight of the initial binary data. 

[0027] According to another aspect of the present invention, a communication 
encoding apparatus comprises an input for receiving a first sequence of binary digits that 
collectively have a characteristic Hamming weight, a processor for inverting each of the 
binary digits in the first sequence of binary digits if the Hamming weight of the first 
sequence of binary digits is below a predetermined threshold Hamming weight value and 
providing an indication of whether the binary digits in the first sequence of binary digits 
have been inverted. 

[0028] According to yet another aspect of the present invention, a system for 
communicating a bitstream having a characteristic Hamming weight to a destination via a 
channel comprises a first processor at a source of the bitstream for determining the 
characteristic Hamming weight of the bitstream and for inverting each bit in the bitstream 
if the characteristic Hamming weight of the bitstream is below a threshold value and 
developing an indication of whether the bits in the bitstream are inverted and a second 
processor at the destination and in communication with the channel for receiving the 
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bitstream and the indication and inverting each bit in the bitstream at the destination if the 
indication indicates that the bits are inverted. The first processor is in communication 
with the channel to permit delivery of the bitstream and the indication of whether the bits 
in the bitstream are inverted to the destination via the channel. 

[0029] A communication decoder according to the present invention may include a 
memory for receiving processed binary data including a binary code word and an 
indication of whether bits of the binary code word are inverted and a processor for 
inverting each bit of the binary code word if the indication indicates that the bits are 
inverted. 

[0030] For carrying out the method of the present invention, a computer-readable 
medium may be provided having stored thereon a first set of machine-executable 
instructions for obtaining initial binary data having a characteristic Hamming weight, a 
second set of machine-executable instructions for determining the characteristic 
Hamming weight of the initial binary data, a third set of machine-executable instructions 
for performing a comparison of the characteristic Hamming weight of the initial binary 
data with a predetermined value, and a fourth set of machine-executable instructions for 
processing the initial binary data based on the comparison to thereby develop processed 
binary data having a Hamming weight not less than the characteristic Hamming weight of 
the initial binary data. 
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[0031] Alternatively, a computer-readable medium in accordance with the principles of 
the present invention may have stored thereon a first set of machine-executable 
instructions for obtaining a first sequence of binary digits that collectively have a 
characteristic Hamming weight, a second set of machine-executable instructions for 
inverting each of the binary digits in the first sequence of binary digits if the Hamming 
weight of the first sequence of binary digits is below a predetermined threshold Hamming 
weight value, and a third set of machine-executable instructions for providing an 
indication of whether the binary digits in the first sequence of binary digits have been 
inverted. 

"4 

f, [0032] In accordance with yet another aspect of the present invention, a computer- 

•yj readable medium may have machine-executable instructions stored thereon for 

Q communicating a bitstream having a characteristic Hamming weight to a destination via a 

ft 

M channel. More particularly, the machine-executable instructions may include a first set of 

Q machine-executable instructions for determining the characteristic Hamming weight of 

I1J 

the bitstream, a second set of machine-executable instructions for inverting each bit in the 
bitstream if the characteristic Hamming weight of the bitstream is below a threshold 
value and developing an indication of whether the bits in the bitstream are inverted, a 
third-set of machine-executable instructions for delivering the bitstream and the 
indication of whether the bits in the bitstream are inverted to the destination via the 
channel, and a fourth set of machine-executable instructions for inverting each bit in the 
bitstream at the destination if the indication indicates that the bits are inverted. 
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[00331 In accordance with another aspect of the present invention, a computer-readable 
medium may have stored thereon machine-executable instructions for communicating 
data from a source to a destination via a channel, the machine-executable instructions 
including a first set for obtaining initial binary data having a characteristic Hamming 
weight at the source, a second set for determining the characteristic Hamming weight of 
the initial binary data, a third set for performing a comparison of the characteristic 
Hamming weight of the initial binary data with a predetermined value, and a fourth set 
for processing the initial binary data based on the comparison to thereby develop 
processed binary data having a Hamming weight not less than the characteristic 
Hamming weight of the initial binary data. A fifth set of machine-executable instructions 
may be provided for conveying the processed binary data from the source to the 
destination via the channel. 

[0034] A disk drive in accordance with the principles of the present invention may 
include a data input for receiving initial binary data having a characteristic Hamming 
weight, and a processor in communication with the data input for determining the 
characteristic Hamming weight of the initial binary data, performing a comparison of the 
characteristic Hamming weight of the initial binary data with a predetermined value, and 
processing the initial binary data based on the comparison to thereby develop processed 
binary data having a Hamming weight not less than the characteristic Hamming weight of 
the initial binary data. 
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[0035] According to another aspect of the present invention, a disk drive includes an 
input for receiving a first sequence of binary digits that collectively have a characteristic 
Hamming weight, a processor for inverting each of the binary digits in the first sequence 
of binary digits if the Hamming weight of the first sequence of binary digits is below a 
predetermined threshold Hamming weight value and providing an indication of whether 
the binary digits in the first sequence of binary digits have been inverted. 

BRIEF DESCRIPTION OF THE DRAWING 

[0036] FIG. 1 is a block diagram illustrating one embodiment of a prior-art encoder for 
encoding user data; 

[0037] FIG. 2 is a block diagram illustrating a prior-art decoder for decoding 
information encoded by the encoder of FIG. 1; 

[0038] FIG. 3 is a block diagram illustrating a second embodiment of a prior-art 
encoder; 

[0039] FIG. 4 is a block diagram illustrating a prior-art decoder for decoding 
information encoded by the encoder of FIG. 3; 

[0040] FIG. 5 is a block diagram illustrating one exemplary embodiment of an encoder 
for encoding information in accordance with the method of the present invention; and 
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[0041] FIG, 6 is a block diagram illustrating one exemplary embodiment of a decoder 
for decoding information in accordance with the method of the present invention, such as, 
for example, information encoded by the encoder of FIG. 5. 

DETAILED DESCRIPTION 
[0042] Although the permuted ECC/RLL encoding scheme described above provides a 
highly effective technical solution, the possible industry acceptance problem that it may 
encounter, as also described above, cannot be ignored. Instead of risking loss of market 
share, it is far easier for a drive manufacturer to accept a different encoding scheme that 
provides nearly as much benefit as a permuted ECC/RLL scheme, without the attendant 
difficulty in educating potential customers in how to reliably assess the effectiveness of 
the encoding scheme used by the drive in correcting errors. 

[0043] The present invention provides one approach to improving the error-correction 
performance of an encoding/decoding scheme. Preferably, data is encoded using codes 
with small error propagation to enhance the effectiveness of the error correction coding 
(ECC) process. However, such codes generally have very low Hamming weights, such 
that a secondary encoding scheme is needed to enhance the Hamming weights of the 
codes while maintaining minimal loss in code rate. Such a secondary encoding scheme 
must improve the overall Hamming weight of the data sector to be communicated or 
stored without requiring successful scrambling. In accordance with the principles of the 
present invention, this secondary encoding scheme may employ a Hamming weight 
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encoder to enhance the Hamming weight of the data to be encoded prior to error 
correction coding of that data. Also in accordance with the principles of the present 
invention, the symbol boundary of an encoded symbol advantageously may be left 
unchanged relative to the error correction coding. 

[0044] One exemplary write-channel encoding apparatus 79 for encoding data in 
accordance with the principles of the present invention is illustrated in FIG. 5. The 
illustrated encoding apparatus 79 receives user data 80 comprising a stream of binary 
digits or bits of information. This stream of bits undergoes Hamming weight encoding 
(block 82), error correction coding (block 84), run length limit encoding (block 86), and 
precoding (block 88), and the resulting Hamming-weight-enhanced encoded bitstream is 
written to a conventional medium or a communication channel (block 90). 

[0045] A dotted line 92 interconnecting the encoding apparatus 79 of FIG. 5 with a 
read-channel decoding apparatus 93 (shown in FIG. 6) represents the media and/or the 
communication channel to which the encoded bit stream is written (block 90). 

[0046] As shown in FIG. 6, the decoding apparatus 93 reads the encoded bit stream 
from the media or channel 92 (block 94) and performs MLSD detection (block 96), 
reversed precoding (block 98), run length limit decoding (block 100), error correction 
decoding (block 102), and Hamming weight decoding (block 104) to thereby develop 
user data 106, which corresponds to the user data 80 processed by the encoding apparatus 
79 of FIG. 5. 
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[0047] The Hamming weight encoder preferably has knowledge about the data symbols 
that will not be RLL encoded by the RLL encoder. The user data 80, in the aggregate, is 
divided into several large groups of symbols. Each group is analyzed, and the total 
Hamming weight for the portion that is not to be RLL encoded is determined. This total 
Hamming weight is then compared with a predetermined threshold Hamming weight 
value (e.g., a value equivalent to 50 percent of the length of the group of bits of the user 
data 80). If the total Hamming weight is too small (i.e., is less than the predetermined 
threshold Hamming weight value), then all of the bits in this group are inverted. An 
additional bit or flag is appended at the end of the user data 80 to provide an indication of 
whether or not the bits in the group have been inverted. As will be readily apparent to 
those of ordinary skill in the art, this procedure ensures that the group of bits of data to be 
encoded has a Hamming weight that is at least as great as 50% of the size of the group. 

[0048] By using groups of a relatively large size (e.g., on the order of hundreds of bits 
per group), the total number of extra "code inversion tracking" or "flag" bits appended by 
the Hamming weight encoder (and the percentage of the total encoded bitstream occupied 
by those flag bits) can be minimized. This ensures that the performance of the channel is 
not significantly degraded by code rate loss. Of course, it is possible that the Hamming 
weight encoder 82 looks at the Hamming weight of the whole group including the 
encoded code word. Such an approach is not preferable, but it does not materially affect 
the performance of the encoding system, because the RLL encoding is designed to 
provide a minimal number of RLL-encoded symbols per code word. It is also possible 
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that the Hamming weight encoder 82 looks at only some of the unencoded symbols as 
opposed to all of them, as well as that encoded symbols are also included in the 
Hamming weight calculation and Hamming weight encoding. Again this is not 
preferable but it may be done in the interest of increasing the total Hamming weight of 
the data written to the media or communication channel. 

[0049] On the read back side (i.e. , in the decoder 93), the ECC process is assumed to 
have fully corrected all the errors such that all of the code inversion tracking bits may, in 
turn, be assumed to be correct. As a consequence, the Hamming weight decoder 104 can 
reliably invert the symbols that have been inverted by the Hamming weight encoding 
process. 

[0050] As an illustrative example, user data 80 may be organized in 10-bit symbols Ul, 
U2, U3, U4, . . . Un. An RLL code is employed such that for each 3 -symbol sequence 
{ul,u2,u3}, ul is encoded to El and separated into el a and elb; u2 and u3 are 
unencoded; and the final RLL code word is constructed as{ela, u2, elb, u3}. Thus, U2, 
U3, U5, U6, U8, and U9, and so on, will not be encoded. The Hamming weight encoder 
82 may choose to operate over a group of, for example, 51 symbols. The total number of 
symbols that will not be RLL-encoded in this group is then 34. The Hamming weight 
encoder takes the sum of the one-valued bits in this 34 symbols, and, if the sum is less 
than 34*10/2 = 170, then all of the bits in the 34 symbols will be inverted. A one-valued 
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bit is then appended at the end of the user data 80 to indicate that the group has been 
inverted. If the group is not inverted, a zero- valued bit is appended. 

[0051] The Hamming weight encoder 82 repeats this procedure for all groups of bits in 
the user data 80. 

[0052] Although the user data 80 may not be divisible into Hamming weight encoding 
groups of equal size in every instance, this does not affect the performance of the 
Hamming weight encoding method of the present invention. However, it is preferable for 
all of the Hamming weight encoding groups to have equal size, if possible, so that the 
Hamming weight decoder 104 need only keep track of the typical group size and the last 
group size (which may differ due to a remainder of bits following division of the user 
data 80 into a number of groups of equal size). It is also possible for small groups of data 
to remain unencoded as long as the controller keeps track of the encoding format applied 
so that the decoding process can be done correctly. Although scrambling of data is more 
commonly done after ECC encoding, in accordance with the principles of the present 
invention, it is preferable, but not absolutely necessary, that scrambling be done before 
Hamming weight encoding. 

[0053] While the present invention will be described with respect to an encoder for use 
in general purpose computers (e.g., in disk drives, printers, routers, etc.), it is to be 
understood that the present invention may also find applicability in other noisy channels 
(e.g., wireless, etc.) and even in other fields such as Internet communications, 
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telecommunications, or any processor-to-processor applications. Also, rather than being 
embodied in discrete card, the method and apparatus of the present invention 
alternatively may advantageously be incorporated directly into a computer "mother 
board" or any other suitable hardware configuration, if desired. 

[0054] The foregoing description is for the purpose of teaching those skilled in the art 
the best mode of carrying out the invention and is to be construed as illustrative only. 
Numerous modifications and alternative embodiments of the invention will be apparent 
to those skilled in the art in view of this description, and the details of the disclosed 
structure may be varied substantially without departing from the spirit of the invention. 
Accordingly, the exclusive use of all modifications within the scope of the appended 
claims is reserved. 
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